Overview

Dataset statistics

Number of variables25
Number of observations4250
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory626.8 KiB
Average record size in memory151.0 B

Variable types

Numeric19
Categorical6

Alerts

total_day_minutes is highly correlated with total_day_chargeHigh correlation
total_day_calls is highly correlated with TotalDaychargePerCallHigh correlation
total_day_charge is highly correlated with total_day_minutesHigh correlation
total_eve_minutes is highly correlated with total_eve_chargeHigh correlation
total_eve_calls is highly correlated with TotalEvechargePerCall High correlation
total_eve_charge is highly correlated with total_eve_minutesHigh correlation
total_night_minutes is highly correlated with total_night_chargeHigh correlation
total_night_calls is highly correlated with TotalNightchargePercallHigh correlation
total_night_charge is highly correlated with total_night_minutesHigh correlation
total_intl_minutes is highly correlated with total_intl_chargeHigh correlation
total_intl_calls is highly correlated with TotalIntnlchargePerCallHigh correlation
total_intl_charge is highly correlated with total_intl_minutesHigh correlation
area_code_area_code_408 is highly correlated with area_code_area_code_415High correlation
area_code_area_code_415 is highly correlated with area_code_area_code_408 and 1 other fieldsHigh correlation
area_code_area_code_510 is highly correlated with area_code_area_code_415High correlation
TotalDaychargePerCall is highly correlated with total_day_callsHigh correlation
TotalNightchargePercall is highly correlated with total_night_callsHigh correlation
TotalEvechargePerCall is highly correlated with total_eve_callsHigh correlation
TotalIntnlchargePerCall is highly correlated with total_intl_callsHigh correlation
total_day_minutes is highly correlated with total_day_chargeHigh correlation
total_day_calls is highly correlated with TotalDaychargePerCallHigh correlation
total_day_charge is highly correlated with total_day_minutesHigh correlation
total_eve_minutes is highly correlated with total_eve_chargeHigh correlation
total_eve_calls is highly correlated with TotalEvechargePerCall High correlation
total_eve_charge is highly correlated with total_eve_minutesHigh correlation
total_night_minutes is highly correlated with total_night_chargeHigh correlation
total_night_calls is highly correlated with TotalNightchargePercallHigh correlation
total_night_charge is highly correlated with total_night_minutesHigh correlation
total_intl_minutes is highly correlated with total_intl_chargeHigh correlation
total_intl_calls is highly correlated with TotalIntnlchargePerCallHigh correlation
total_intl_charge is highly correlated with total_intl_minutesHigh correlation
area_code_area_code_408 is highly correlated with area_code_area_code_415High correlation
area_code_area_code_415 is highly correlated with area_code_area_code_408 and 1 other fieldsHigh correlation
area_code_area_code_510 is highly correlated with area_code_area_code_415High correlation
TotalDaychargePerCall is highly correlated with total_day_callsHigh correlation
TotalNightchargePercall is highly correlated with total_night_callsHigh correlation
TotalEvechargePerCall is highly correlated with total_eve_callsHigh correlation
TotalIntnlchargePerCall is highly correlated with total_intl_callsHigh correlation
total_day_minutes is highly correlated with total_day_chargeHigh correlation
total_day_calls is highly correlated with TotalDaychargePerCallHigh correlation
total_day_charge is highly correlated with total_day_minutesHigh correlation
total_eve_minutes is highly correlated with total_eve_chargeHigh correlation
total_eve_calls is highly correlated with TotalEvechargePerCall High correlation
total_eve_charge is highly correlated with total_eve_minutesHigh correlation
total_night_minutes is highly correlated with total_night_chargeHigh correlation
total_night_calls is highly correlated with TotalNightchargePercallHigh correlation
total_night_charge is highly correlated with total_night_minutesHigh correlation
total_intl_minutes is highly correlated with total_intl_chargeHigh correlation
total_intl_calls is highly correlated with TotalIntnlchargePerCallHigh correlation
total_intl_charge is highly correlated with total_intl_minutesHigh correlation
area_code_area_code_408 is highly correlated with area_code_area_code_415High correlation
area_code_area_code_415 is highly correlated with area_code_area_code_408 and 1 other fieldsHigh correlation
area_code_area_code_510 is highly correlated with area_code_area_code_415High correlation
TotalDaychargePerCall is highly correlated with total_day_callsHigh correlation
TotalNightchargePercall is highly correlated with total_night_callsHigh correlation
TotalEvechargePerCall is highly correlated with total_eve_callsHigh correlation
TotalIntnlchargePerCall is highly correlated with total_intl_callsHigh correlation
area_code_area_code_415 is highly correlated with area_code_area_code_408 and 1 other fieldsHigh correlation
area_code_area_code_408 is highly correlated with area_code_area_code_415High correlation
area_code_area_code_510 is highly correlated with area_code_area_code_415High correlation
total_day_minutes is highly correlated with total_day_chargeHigh correlation
total_day_calls is highly correlated with TotalDaychargePerCallHigh correlation
total_day_charge is highly correlated with total_day_minutesHigh correlation
total_eve_minutes is highly correlated with total_eve_chargeHigh correlation
total_eve_calls is highly correlated with TotalEvechargePerCall High correlation
total_eve_charge is highly correlated with total_eve_minutesHigh correlation
total_night_minutes is highly correlated with total_night_calls and 2 other fieldsHigh correlation
total_night_calls is highly correlated with total_night_minutes and 2 other fieldsHigh correlation
total_night_charge is highly correlated with total_night_minutes and 2 other fieldsHigh correlation
total_intl_minutes is highly correlated with total_intl_chargeHigh correlation
total_intl_calls is highly correlated with TotalIntnlchargePerCallHigh correlation
total_intl_charge is highly correlated with total_intl_minutesHigh correlation
area_code_area_code_408 is highly correlated with area_code_area_code_415 and 1 other fieldsHigh correlation
area_code_area_code_415 is highly correlated with area_code_area_code_408 and 1 other fieldsHigh correlation
area_code_area_code_510 is highly correlated with area_code_area_code_408 and 1 other fieldsHigh correlation
TotalDaychargePerCall is highly correlated with total_day_callsHigh correlation
TotalNightchargePercall is highly correlated with total_night_minutes and 2 other fieldsHigh correlation
TotalEvechargePerCall is highly correlated with total_eve_callsHigh correlation
TotalIntnlchargePerCall is highly correlated with total_intl_callsHigh correlation
state has 61 (1.4%) zeros Zeros
number_customer_service_calls has 886 (20.8%) zeros Zeros

Reproduction

Analysis started2021-11-02 19:21:29.196062
Analysis finished2021-11-02 19:23:06.941646
Duration1 minute and 37.75 seconds
Software versionpandas-profiling v3.1.1
Download configurationconfig.json

Variables

state
Real number (ℝ≥0)

ZEROS

Distinct51
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.09411765
Minimum0
Maximum50
Zeros61
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size4.3 KiB

Quantile statistics

Minimum0
5-th percentile2
Q114
median26
Q339
95-th percentile49
Maximum50
Range50
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.76904934
Coefficient of variation (CV)0.565991521
Kurtosis-1.186782745
Mean26.09411765
Median Absolute Deviation (MAD)13
Skewness-0.05908461521
Sum110900
Variance218.1248183
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
49139
 
3.3%
23108
 
2.5%
13106
 
2.5%
1101
 
2.4%
45100
 
2.4%
3799
 
2.3%
4398
 
2.3%
4497
 
2.3%
3496
 
2.3%
3196
 
2.3%
Other values (41)3210
75.5%
ValueCountFrequency (%)
061
1.4%
1101
2.4%
271
1.7%
377
1.8%
439
 
0.9%
580
1.9%
688
2.1%
772
1.7%
880
1.9%
976
1.8%
ValueCountFrequency (%)
5095
2.2%
49139
3.3%
4894
2.2%
4780
1.9%
4686
2.0%
45100
2.4%
4497
2.3%
4398
2.3%
4279
1.9%
4175
1.8%

account_length
Real number (ℝ≥0)

Distinct215
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.2362353
Minimum1
Maximum243
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum1
5-th percentile35.45
Q173
median100
Q3127
95-th percentile167
Maximum243
Range242
Interquartile range (IQR)54

Descriptive statistics

Standard deviation39.69840057
Coefficient of variation (CV)0.3960483996
Kurtosis-0.1321747749
Mean100.2362353
Median Absolute Deviation (MAD)27
Skewness0.1223273244
Sum426004
Variance1575.963008
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9053
 
1.2%
8751
 
1.2%
9350
 
1.2%
10048
 
1.1%
12048
 
1.1%
10548
 
1.1%
11647
 
1.1%
9847
 
1.1%
12747
 
1.1%
11246
 
1.1%
Other values (205)3765
88.6%
ValueCountFrequency (%)
17
0.2%
22
 
< 0.1%
37
0.2%
42
 
< 0.1%
52
 
< 0.1%
62
 
< 0.1%
75
0.1%
81
 
< 0.1%
92
 
< 0.1%
103
0.1%
ValueCountFrequency (%)
2431
 
< 0.1%
2322
< 0.1%
2252
< 0.1%
2242
< 0.1%
2222
< 0.1%
2211
 
< 0.1%
2173
0.1%
2161
 
< 0.1%
2151
 
< 0.1%
2121
 
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.3 KiB
0
3854 
1
396 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4250
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row1
5th row0

Common Values

ValueCountFrequency (%)
03854
90.7%
1396
 
9.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
03854
90.7%
1396
 
9.3%

Most occurring characters

ValueCountFrequency (%)
03854
90.7%
1396
 
9.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4250
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
03854
90.7%
1396
 
9.3%

Most occurring scripts

ValueCountFrequency (%)
Common4250
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
03854
90.7%
1396
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII4250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
03854
90.7%
1396
 
9.3%

voice_mail_plan
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.3 KiB
0
3138 
1
1112 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4250
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
03138
73.8%
11112
 
26.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
03138
73.8%
11112
 
26.2%

Most occurring characters

ValueCountFrequency (%)
03138
73.8%
11112
 
26.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4250
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
03138
73.8%
11112
 
26.2%

Most occurring scripts

ValueCountFrequency (%)
Common4250
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
03138
73.8%
11112
 
26.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII4250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
03138
73.8%
11112
 
26.2%

total_day_minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1843
Distinct (%)43.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean180.2596
Minimum0
Maximum351.5
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile91.59
Q1143.325
median180.45
Q3216.2
95-th percentile271.055
Maximum351.5
Range351.5
Interquartile range (IQR)72.875

Descriptive statistics

Standard deviation54.01237333
Coefficient of variation (CV)0.2996365982
Kurtosis-0.05670971637
Mean180.2596
Median Absolute Deviation (MAD)36.6
Skewness-0.006910229801
Sum766103.3
Variance2917.336473
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
189.310
 
0.2%
1809
 
0.2%
184.58
 
0.2%
1548
 
0.2%
177.18
 
0.2%
168.67
 
0.2%
230.77
 
0.2%
183.67
 
0.2%
1977
 
0.2%
1857
 
0.2%
Other values (1833)4172
98.2%
ValueCountFrequency (%)
02
< 0.1%
2.61
< 0.1%
6.61
< 0.1%
7.21
< 0.1%
7.81
< 0.1%
7.91
< 0.1%
25.91
< 0.1%
271
< 0.1%
29.91
< 0.1%
30.91
< 0.1%
ValueCountFrequency (%)
351.51
< 0.1%
346.81
< 0.1%
345.31
< 0.1%
338.41
< 0.1%
337.41
< 0.1%
335.51
< 0.1%
334.31
< 0.1%
332.91
< 0.1%
332.11
< 0.1%
329.81
< 0.1%

total_day_calls
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct120
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99.90729412
Minimum0
Maximum165
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile67
Q187
median100
Q3113
95-th percentile133
Maximum165
Range165
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.85081731
Coefficient of variation (CV)0.1986923726
Kurtosis0.1935936484
Mean99.90729412
Median Absolute Deviation (MAD)13
Skewness-0.08581246337
Sum424606
Variance394.054948
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
105101
 
2.4%
9597
 
2.3%
11092
 
2.2%
9492
 
2.2%
11290
 
2.1%
10289
 
2.1%
9788
 
2.1%
10787
 
2.0%
10085
 
2.0%
10884
 
2.0%
Other values (110)3345
78.7%
ValueCountFrequency (%)
02
< 0.1%
301
 
< 0.1%
341
 
< 0.1%
351
 
< 0.1%
361
 
< 0.1%
402
< 0.1%
421
 
< 0.1%
444
0.1%
453
0.1%
461
 
< 0.1%
ValueCountFrequency (%)
1651
 
< 0.1%
1602
 
< 0.1%
1582
 
< 0.1%
1572
 
< 0.1%
1563
 
0.1%
1522
 
< 0.1%
1516
0.1%
1503
 
0.1%
1486
0.1%
1478
0.2%

total_day_charge
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1843
Distinct (%)43.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.64468235
Minimum0
Maximum59.76
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile15.5735
Q124.365
median30.68
Q336.75
95-th percentile46.081
Maximum59.76
Range59.76
Interquartile range (IQR)12.385

Descriptive statistics

Standard deviation9.182096033
Coefficient of variation (CV)0.2996309744
Kurtosis-0.0565844345
Mean30.64468235
Median Absolute Deviation (MAD)6.225
Skewness-0.006912526228
Sum130239.9
Variance84.31088755
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32.1810
 
0.2%
30.69
 
0.2%
30.118
 
0.2%
31.378
 
0.2%
26.188
 
0.2%
31.457
 
0.2%
34.587
 
0.2%
29.587
 
0.2%
28.637
 
0.2%
28.667
 
0.2%
Other values (1833)4172
98.2%
ValueCountFrequency (%)
02
< 0.1%
0.441
< 0.1%
1.121
< 0.1%
1.221
< 0.1%
1.331
< 0.1%
1.341
< 0.1%
4.41
< 0.1%
4.591
< 0.1%
5.081
< 0.1%
5.251
< 0.1%
ValueCountFrequency (%)
59.761
< 0.1%
58.961
< 0.1%
58.71
< 0.1%
57.531
< 0.1%
57.361
< 0.1%
57.041
< 0.1%
56.831
< 0.1%
56.591
< 0.1%
56.461
< 0.1%
56.071
< 0.1%

total_eve_minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1773
Distinct (%)41.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.1739059
Minimum0
Maximum359.3
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile118.2
Q1165.925
median200.7
Q3233.775
95-th percentile282.71
Maximum359.3
Range359.3
Interquartile range (IQR)67.85

Descriptive statistics

Standard deviation50.24951818
Coefficient of variation (CV)0.2510293135
Kurtosis0.04345320215
Mean200.1739059
Median Absolute Deviation (MAD)33.7
Skewness-0.03041458624
Sum850739.1
Variance2525.014078
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
230.910
 
0.2%
187.59
 
0.2%
1949
 
0.2%
169.99
 
0.2%
199.79
 
0.2%
2018
 
0.2%
216.58
 
0.2%
223.58
 
0.2%
209.48
 
0.2%
211.58
 
0.2%
Other values (1763)4164
98.0%
ValueCountFrequency (%)
01
< 0.1%
22.31
< 0.1%
37.81
< 0.1%
41.71
< 0.1%
42.21
< 0.1%
42.51
< 0.1%
43.91
< 0.1%
47.31
< 0.1%
48.11
< 0.1%
49.21
< 0.1%
ValueCountFrequency (%)
359.31
< 0.1%
352.11
< 0.1%
351.61
< 0.1%
349.41
< 0.1%
348.51
< 0.1%
347.31
< 0.1%
345.11
< 0.1%
344.91
< 0.1%
3441
< 0.1%
341.31
< 0.1%

total_eve_calls
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct123
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.1764706
Minimum0
Maximum170
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile67
Q187
median100
Q3114
95-th percentile133
Maximum170
Range170
Interquartile range (IQR)27

Descriptive statistics

Standard deviation19.9085911
Coefficient of variation (CV)0.1987352019
Kurtosis0.1145997215
Mean100.1764706
Median Absolute Deviation (MAD)13
Skewness-0.02081182363
Sum425750
Variance396.3519998
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10598
 
2.3%
10396
 
2.3%
9195
 
2.2%
9791
 
2.1%
9488
 
2.1%
9688
 
2.1%
10888
 
2.1%
8887
 
2.0%
10186
 
2.0%
10485
 
2.0%
Other values (113)3348
78.8%
ValueCountFrequency (%)
01
 
< 0.1%
121
 
< 0.1%
361
 
< 0.1%
381
 
< 0.1%
431
 
< 0.1%
442
 
< 0.1%
451
 
< 0.1%
465
0.1%
471
 
< 0.1%
486
0.1%
ValueCountFrequency (%)
1701
 
< 0.1%
1691
 
< 0.1%
1681
 
< 0.1%
1591
 
< 0.1%
1571
 
< 0.1%
1561
 
< 0.1%
1555
0.1%
1543
0.1%
1531
 
< 0.1%
1526
0.1%

total_eve_charge
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1572
Distinct (%)37.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.01501176
Minimum0
Maximum30.54
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile10.05
Q114.1025
median17.06
Q319.8675
95-th percentile24.031
Maximum30.54
Range30.54
Interquartile range (IQR)5.765

Descriptive statistics

Standard deviation4.271211992
Coefficient of variation (CV)0.2510260969
Kurtosis0.04332949445
Mean17.01501176
Median Absolute Deviation (MAD)2.86
Skewness-0.03038789084
Sum72313.8
Variance18.24325188
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16.1213
 
0.3%
18.7913
 
0.3%
14.2513
 
0.3%
16.9712
 
0.3%
15.912
 
0.3%
18.9611
 
0.3%
16.810
 
0.2%
19.6310
 
0.2%
17.0910
 
0.2%
16.419
 
0.2%
Other values (1562)4137
97.3%
ValueCountFrequency (%)
01
< 0.1%
1.91
< 0.1%
3.211
< 0.1%
3.541
< 0.1%
3.591
< 0.1%
3.611
< 0.1%
3.731
< 0.1%
4.021
< 0.1%
4.091
< 0.1%
4.181
< 0.1%
ValueCountFrequency (%)
30.541
< 0.1%
29.931
< 0.1%
29.891
< 0.1%
29.71
< 0.1%
29.621
< 0.1%
29.521
< 0.1%
29.331
< 0.1%
29.321
< 0.1%
29.241
< 0.1%
29.011
< 0.1%

total_night_minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1757
Distinct (%)41.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.5278824
Minimum0
Maximum395
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile118.09
Q1167.225
median200.45
Q3234.7
95-th percentile282.71
Maximum395
Range395
Interquartile range (IQR)67.475

Descriptive statistics

Standard deviation50.35354807
Coefficient of variation (CV)0.251104971
Kurtosis0.1148535776
Mean200.5278824
Median Absolute Deviation (MAD)33.55
Skewness0.008490819348
Sum852243.5
Variance2535.479804
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
186.211
 
0.3%
208.910
 
0.2%
188.28
 
0.2%
169.48
 
0.2%
193.68
 
0.2%
230.18
 
0.2%
190.58
 
0.2%
228.18
 
0.2%
214.78
 
0.2%
2148
 
0.2%
Other values (1747)4165
98.0%
ValueCountFrequency (%)
01
< 0.1%
23.21
< 0.1%
43.71
< 0.1%
451
< 0.1%
46.71
< 0.1%
47.41
< 0.1%
50.12
< 0.1%
53.31
< 0.1%
541
< 0.1%
54.51
< 0.1%
ValueCountFrequency (%)
3951
< 0.1%
381.91
< 0.1%
381.61
< 0.1%
377.51
< 0.1%
367.71
< 0.1%
364.91
< 0.1%
359.91
< 0.1%
355.11
< 0.1%
352.51
< 0.1%
352.21
< 0.1%

total_night_calls
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct128
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99.83952941
Minimum0
Maximum175
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile67
Q186
median100
Q3113
95-th percentile132
Maximum175
Range175
Interquartile range (IQR)27

Descriptive statistics

Standard deviation20.09321979
Coefficient of variation (CV)0.2012551532
Kurtosis0.07721835856
Mean99.83952941
Median Absolute Deviation (MAD)14
Skewness0.005273110227
Sum424318
Variance403.7374815
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
105100
 
2.4%
9992
 
2.2%
9591
 
2.1%
10290
 
2.1%
9188
 
2.1%
9488
 
2.1%
10487
 
2.0%
9887
 
2.0%
10086
 
2.0%
10985
 
2.0%
Other values (118)3356
79.0%
ValueCountFrequency (%)
01
 
< 0.1%
331
 
< 0.1%
361
 
< 0.1%
382
< 0.1%
401
 
< 0.1%
411
 
< 0.1%
424
0.1%
431
 
< 0.1%
441
 
< 0.1%
463
0.1%
ValueCountFrequency (%)
1751
< 0.1%
1701
< 0.1%
1651
< 0.1%
1641
< 0.1%
1611
< 0.1%
1601
< 0.1%
1592
< 0.1%
1582
< 0.1%
1572
< 0.1%
1562
< 0.1%

total_night_charge
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct992
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.023891765
Minimum0
Maximum17.77
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile5.3145
Q17.5225
median9.02
Q310.56
95-th percentile12.7255
Maximum17.77
Range17.77
Interquartile range (IQR)3.0375

Descriptive statistics

Standard deviation2.265921811
Coefficient of variation (CV)0.2511025033
Kurtosis0.1148651735
Mean9.023891765
Median Absolute Deviation (MAD)1.51
Skewness0.008444754041
Sum38351.54
Variance5.134401655
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9.418
 
0.4%
9.6317
 
0.4%
8.1517
 
0.4%
10.817
 
0.4%
9.6616
 
0.4%
8.8215
 
0.4%
10.4915
 
0.4%
9.7615
 
0.4%
8.5714
 
0.3%
10.3514
 
0.3%
Other values (982)4092
96.3%
ValueCountFrequency (%)
01
< 0.1%
1.041
< 0.1%
1.971
< 0.1%
2.031
< 0.1%
2.11
< 0.1%
2.131
< 0.1%
2.252
< 0.1%
2.41
< 0.1%
2.431
< 0.1%
2.451
< 0.1%
ValueCountFrequency (%)
17.771
< 0.1%
17.191
< 0.1%
17.171
< 0.1%
16.991
< 0.1%
16.551
< 0.1%
16.421
< 0.1%
16.21
< 0.1%
15.981
< 0.1%
15.861
< 0.1%
15.851
< 0.1%

total_intl_minutes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct168
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.25607059
Minimum0
Maximum20
Zeros22
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile5.7
Q18.5
median10.3
Q312
95-th percentile14.6
Maximum20
Range20
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation2.760101726
Coefficient of variation (CV)0.2691188309
Kurtosis0.7029511928
Mean10.25607059
Median Absolute Deviation (MAD)1.8
Skewness-0.2413595394
Sum43588.3
Variance7.618161539
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11.175
 
1.8%
9.873
 
1.7%
11.473
 
1.7%
10.272
 
1.7%
10.971
 
1.7%
11.370
 
1.6%
10.169
 
1.6%
9.768
 
1.6%
9.566
 
1.6%
10.566
 
1.6%
Other values (158)3547
83.5%
ValueCountFrequency (%)
022
0.5%
0.41
 
< 0.1%
1.12
 
< 0.1%
1.31
 
< 0.1%
22
 
< 0.1%
2.12
 
< 0.1%
2.22
 
< 0.1%
2.41
 
< 0.1%
2.51
 
< 0.1%
2.61
 
< 0.1%
ValueCountFrequency (%)
201
 
< 0.1%
19.72
< 0.1%
19.31
 
< 0.1%
19.21
 
< 0.1%
18.91
 
< 0.1%
18.51
 
< 0.1%
18.41
 
< 0.1%
18.31
 
< 0.1%
18.22
< 0.1%
183
0.1%

total_intl_calls
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct21
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.426352941
Minimum0
Maximum20
Zeros22
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q36
95-th percentile9
Maximum20
Range20
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.463069113
Coefficient of variation (CV)0.5564556522
Kurtosis3.263227525
Mean4.426352941
Median Absolute Deviation (MAD)1
Skewness1.360122209
Sum18812
Variance6.066709454
MonotonicityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
3847
19.9%
4795
18.7%
2644
15.2%
5598
14.1%
6408
9.6%
7272
 
6.4%
1226
 
5.3%
8153
 
3.6%
9126
 
3.0%
1059
 
1.4%
Other values (11)122
 
2.9%
ValueCountFrequency (%)
022
 
0.5%
1226
 
5.3%
2644
15.2%
3847
19.9%
4795
18.7%
5598
14.1%
6408
9.6%
7272
 
6.4%
8153
 
3.6%
9126
 
3.0%
ValueCountFrequency (%)
201
 
< 0.1%
191
 
< 0.1%
184
 
0.1%
171
 
< 0.1%
167
 
0.2%
159
 
0.2%
145
 
0.1%
1316
0.4%
1218
0.4%
1138
0.9%

total_intl_charge
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct168
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.769654118
Minimum0
Maximum5.4
Zeros22
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile1.54
Q12.3
median2.78
Q33.24
95-th percentile3.94
Maximum5.4
Range5.4
Interquartile range (IQR)0.94

Descriptive statistics

Standard deviation0.7452041364
Coefficient of variation (CV)0.2690603609
Kurtosis0.7033212689
Mean2.769654118
Median Absolute Deviation (MAD)0.48
Skewness-0.2416706661
Sum11771.03
Variance0.5553292049
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
375
 
1.8%
3.0873
 
1.7%
2.6573
 
1.7%
2.7572
 
1.7%
2.9471
 
1.7%
3.0570
 
1.6%
2.7369
 
1.6%
2.6268
 
1.6%
2.8466
 
1.6%
2.5766
 
1.6%
Other values (158)3547
83.5%
ValueCountFrequency (%)
022
0.5%
0.111
 
< 0.1%
0.32
 
< 0.1%
0.351
 
< 0.1%
0.542
 
< 0.1%
0.572
 
< 0.1%
0.592
 
< 0.1%
0.651
 
< 0.1%
0.681
 
< 0.1%
0.71
 
< 0.1%
ValueCountFrequency (%)
5.41
 
< 0.1%
5.322
< 0.1%
5.211
 
< 0.1%
5.181
 
< 0.1%
5.11
 
< 0.1%
51
 
< 0.1%
4.971
 
< 0.1%
4.941
 
< 0.1%
4.912
< 0.1%
4.863
0.1%

number_customer_service_calls
Real number (ℝ≥0)

ZEROS

Distinct10
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.559058824
Minimum0
Maximum9
Zeros886
Zeros (%)20.8%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q32
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.31143353
Coefficient of variation (CV)0.8411700126
Kurtosis1.655618759
Mean1.559058824
Median Absolute Deviation (MAD)1
Skewness1.082691586
Sum6626
Variance1.719857904
MonotonicityNot monotonic
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
11524
35.9%
2947
22.3%
0886
20.8%
3558
 
13.1%
4209
 
4.9%
581
 
1.9%
628
 
0.7%
713
 
0.3%
92
 
< 0.1%
82
 
< 0.1%
ValueCountFrequency (%)
0886
20.8%
11524
35.9%
2947
22.3%
3558
 
13.1%
4209
 
4.9%
581
 
1.9%
628
 
0.7%
713
 
0.3%
82
 
< 0.1%
92
 
< 0.1%
ValueCountFrequency (%)
92
 
< 0.1%
82
 
< 0.1%
713
 
0.3%
628
 
0.7%
581
 
1.9%
4209
 
4.9%
3558
 
13.1%
2947
22.3%
11524
35.9%
0886
20.8%

churn
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.3 KiB
0
3652 
1
598 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4250
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
03652
85.9%
1598
 
14.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
03652
85.9%
1598
 
14.1%

Most occurring characters

ValueCountFrequency (%)
03652
85.9%
1598
 
14.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4250
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
03652
85.9%
1598
 
14.1%

Most occurring scripts

ValueCountFrequency (%)
Common4250
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
03652
85.9%
1598
 
14.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII4250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
03652
85.9%
1598
 
14.1%

area_code_area_code_408
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.3 KiB
0
3164 
1
1086 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4250
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
03164
74.4%
11086
 
25.6%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
03164
74.4%
11086
 
25.6%

Most occurring characters

ValueCountFrequency (%)
03164
74.4%
11086
 
25.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4250
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
03164
74.4%
11086
 
25.6%

Most occurring scripts

ValueCountFrequency (%)
Common4250
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
03164
74.4%
11086
 
25.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII4250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
03164
74.4%
11086
 
25.6%

area_code_area_code_415
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.3 KiB
0
2142 
1
2108 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4250
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row0
4th row1
5th row0

Common Values

ValueCountFrequency (%)
02142
50.4%
12108
49.6%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
02142
50.4%
12108
49.6%

Most occurring characters

ValueCountFrequency (%)
02142
50.4%
12108
49.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4250
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
02142
50.4%
12108
49.6%

Most occurring scripts

ValueCountFrequency (%)
Common4250
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
02142
50.4%
12108
49.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII4250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02142
50.4%
12108
49.6%

area_code_area_code_510
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.3 KiB
0
3194 
1
1056 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4250
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
03194
75.2%
11056
 
24.8%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
03194
75.2%
11056
 
24.8%

Most occurring characters

ValueCountFrequency (%)
03194
75.2%
11056
 
24.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4250
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
03194
75.2%
11056
 
24.8%

Most occurring scripts

ValueCountFrequency (%)
Common4250
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
03194
75.2%
11056
 
24.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII4250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
03194
75.2%
11056
 
24.8%

TotalDaychargePerCall
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct3916
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.98454527
Minimum0
Maximum28.05213731
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile11.39
Q114.78934564
median17.00058888
Q319.21073371
95-th percentile22.60959091
Maximum28.05213731
Range28.05213731
Interquartile range (IQR)4.421388077

Descriptive statistics

Standard deviation3.374701333
Coefficient of variation (CV)0.1986924747
Kurtosis0.1935961754
Mean16.98454527
Median Absolute Deviation (MAD)2.210602988
Skewness-0.08579599779
Sum72184.31741
Variance11.38860908
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1711
 
0.3%
17.1711
 
0.3%
17.3410
 
0.2%
16.159
 
0.2%
18.79
 
0.2%
19.047
 
0.2%
20.917
 
0.2%
17.687
 
0.2%
13.266
 
0.1%
20.236
 
0.1%
Other values (3906)4167
98.0%
ValueCountFrequency (%)
02
< 0.1%
5.100138441
< 0.1%
5.7802505531
< 0.1%
5.9512110731
< 0.1%
6.1207750271
< 0.1%
6.7994336951
< 0.1%
6.7996530791
< 0.1%
7.1409538231
< 0.1%
7.4793085391
< 0.1%
7.4801540081
< 0.1%
ValueCountFrequency (%)
28.052137311
< 0.1%
27.202423021
< 0.1%
27.196519221
< 0.1%
26.865231791
< 0.1%
26.861
< 0.1%
26.691478341
< 0.1%
26.689170631
< 0.1%
26.525435541
< 0.1%
26.522
< 0.1%
25.844349071
< 0.1%

TotalNightchargePercall
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct4058
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.492846656
Minimum0
Maximum7.873006834
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile3.015221015
Q13.870488103
median4.49959968
Q35.086778288
95-th percentile5.942465752
Maximum7.873006834
Range7.873006834
Interquartile range (IQR)1.216290185

Descriptive statistics

Standard deviation0.9042044338
Coefficient of variation (CV)0.2012542388
Kurtosis0.07775190429
Mean4.492846656
Median Absolute Deviation (MAD)0.6282930397
Skewness0.005307809384
Sum19094.59829
Variance0.8175856582
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.045
 
0.1%
4.0055
 
0.1%
4.7254
 
0.1%
4.9954
 
0.1%
4.234
 
0.1%
5.314
 
0.1%
4.954
 
0.1%
4.4554
 
0.1%
3.7354
 
0.1%
4.324
 
0.1%
Other values (4048)4208
99.0%
ValueCountFrequency (%)
01
< 0.1%
1.4850701831
< 0.1%
1.6203237411
< 0.1%
1.7093553081
< 0.1%
1.710466831
< 0.1%
1.7992916171
< 0.1%
1.844766781
< 0.1%
1.8892193311
< 0.1%
1.8893617021
< 0.1%
1.8895969291
< 0.1%
ValueCountFrequency (%)
7.8730068341
< 0.1%
7.6536402571
< 0.1%
7.4273066171
< 0.1%
7.3838862561
< 0.1%
7.2434307991
< 0.1%
7.1976401181
< 0.1%
7.1572507081
< 0.1%
7.1533049041
< 0.1%
7.1105598871
< 0.1%
7.1087027911
< 0.1%

TotalEvechargePerCall
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct4037
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.51513693
Minimum0
Maximum14.45102163
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile5.696086376
Q17.395447515
median8.502036136
Q39.688243691
95-th percentile11.30606308
Maximum14.45102163
Range14.45102163
Interquartile range (IQR)2.292796176

Descriptive statistics

Standard deviation1.692302092
Coefficient of variation (CV)0.1987404438
Kurtosis0.1147777135
Mean8.51513693
Median Absolute Deviation (MAD)1.107319713
Skewness-0.02071057376
Sum36189.33195
Variance2.863886371
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.338
 
0.2%
9.4356
 
0.1%
9.9456
 
0.1%
7.2255
 
0.1%
9.525
 
0.1%
9.015
 
0.1%
9.865
 
0.1%
9.695
 
0.1%
7.825
 
0.1%
8.675
 
0.1%
Other values (4027)4195
98.7%
ValueCountFrequency (%)
01
< 0.1%
1.0199679661
< 0.1%
3.0594087281
< 0.1%
3.2301006891
< 0.1%
3.6548910291
< 0.1%
3.7393437191
< 0.1%
3.7398337111
< 0.1%
3.8254113351
< 0.1%
3.9086510261
< 0.1%
3.9092823711
< 0.1%
ValueCountFrequency (%)
14.451021631
< 0.1%
14.369333331
< 0.1%
14.281268881
< 0.1%
13.517214481
< 0.1%
13.342982011
< 0.1%
13.265217391
< 0.1%
13.180043381
< 0.1%
13.176733781
< 0.1%
13.173388771
< 0.1%
13.172131151
< 0.1%

TotalIntnlchargePerCall
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct870
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.195385027
Minimum0
Maximum5.395683453
Zeros22
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size33.3 KiB

Quantile statistics

Minimum0
5-th percentile0.2704
Q10.8093023256
median1.080291971
Q31.618032787
95-th percentile2.431343284
Maximum5.395683453
Range5.395683453
Interquartile range (IQR)0.8087304613

Descriptive statistics

Standard deviation0.6652220812
Coefficient of variation (CV)0.5564918968
Kurtosis3.265792205
Mean1.195385027
Median Absolute Deviation (MAD)0.2717553854
Skewness1.360520976
Sum5080.386366
Variance0.4425204173
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.8179
 
1.9%
1.3554
 
1.3%
1.0852
 
1.2%
0.5448
 
1.1%
1.6240
 
0.9%
0.811111111128
 
0.7%
0.540740740724
 
0.6%
1.08148148124
 
0.6%
1.07906976722
 
0.5%
1.08085106422
 
0.5%
Other values (860)3857
90.8%
ValueCountFrequency (%)
022
0.5%
0.2690476191
 
< 0.1%
0.26935483871
 
< 0.1%
0.26944444441
 
< 0.1%
0.26949152541
 
< 0.1%
0.26951219511
 
< 0.1%
0.26956521744
 
0.1%
0.26960784315
 
0.1%
0.26962025325
 
0.1%
0.26964285714
 
0.1%
ValueCountFrequency (%)
5.3956834531
< 0.1%
5.141
< 0.1%
4.8626865671
< 0.1%
4.861
< 0.1%
4.861
< 0.1%
4.8558139531
< 0.1%
4.58751
< 0.1%
4.3306666671
< 0.1%
4.3269565221
< 0.1%
4.3259259262
< 0.1%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

stateaccount_lengthinternational_planvoice_mail_plantotal_day_minutestotal_day_callstotal_day_chargetotal_eve_minutestotal_eve_callstotal_eve_chargetotal_night_minutestotal_night_callstotal_night_chargetotal_intl_minutestotal_intl_callstotal_intl_chargenumber_customer_service_callschurnarea_code_area_code_408area_code_area_code_415area_code_area_code_510TotalDaychargePerCallTotalNightchargePercallTotalEvechargePerCallTotalIntnlchargePerCall
03510701161.612327.47195.510316.62254.410311.4513.733.701001020.9084784.6358108.7563170.810219
13113700243.411441.38121.211010.30162.61047.3212.253.290001019.3809374.6819199.3481851.348361
2358410299.47150.9061.9885.26196.9898.866.671.782010012.0704744.0047747.4778681.887879
3367510166.711328.34148.312212.61186.91218.4110.132.733001019.2106785.44467610.3737020.810891
41912101218.28837.09348.510829.62212.61189.577.572.033000114.9583875.3116659.1792251.894667
52414710157.07926.69103.1948.76211.8969.537.161.920001013.4300004.3195477.9868091.622535
61811700184.59731.37351.68029.89215.8909.718.742.351010016.4926294.0495836.8009101.080460
74914111258.68443.96222.011118.87326.49714.6911.253.020001014.2793504.3655949.4350001.348214
8156500129.113721.95228.58319.42208.81119.4012.763.434101023.2931844.9971267.0540921.620472
9397400187.712731.91163.414813.89196.0948.829.152.460001021.5906774.23000012.5809061.351648

Last rows

stateaccount_lengthinternational_planvoice_mail_plantotal_day_minutestotal_day_callstotal_day_chargetotal_eve_minutestotal_eve_callstotal_eve_chargetotal_night_minutestotal_night_callstotal_night_chargetotal_intl_minutestotal_intl_callstotal_intl_chargenumber_customer_service_callschurnarea_code_area_code_408area_code_area_code_415area_code_area_code_510TotalDaychargePerCallTotalNightchargePercallTotalEvechargePerCallTotalIntnlchargePerCall
4240212701157.610726.79280.64923.8575.1773.388.042.161001018.1886423.4655134.1648251.080000
4241478000157.010126.69208.812717.75113.31095.1016.224.372000117.1700004.90644310.7962160.539506
42422315000170.011528.90162.713813.83267.27712.028.322.240010019.5500003.46384711.7304240.539759
42432814000244.711541.60258.610121.98231.311210.417.562.031100119.5504705.0407268.5846091.624000
424439700252.68942.94340.39128.93256.56711.548.852.381100115.1292953.0143477.7362031.352273
4245268300188.37032.01243.88820.72213.7799.6210.362.780001011.8996283.5562947.4789171.619417
4246497300177.98930.24131.28211.15186.2898.3811.563.113010015.1284994.0054786.9687501.622609
4247277500170.710129.02193.112616.41129.11045.816.971.861010017.1705924.68040310.7077161.886957
4248115001235.712740.07223.012618.96297.511613.399.952.672010021.5905395.22097510.7128251.348485
4249468601129.410222.00267.110422.70154.81006.979.3162.510001017.3415774.5025848.8386374.318280